Chemical Structure Reconstruction with chemoCR

نویسنده

  • Marc Zimmermann
چکیده

chemoCR makes chemical information contained in depictions of chemical structures accessible as connection table for computer programs. In order to solve the problem of recognizing and translating chemical structures in image documents, our chemoCR system combines pattern recognition techniques with supervised machine learning concepts. The method is based on the idea of identifying from structural formulas the most significant semantic entities. Semantic entities are for example chiral bonds, superatoms and reaction arrows. The workflow consists of three phases: image preprocessing, semantic entity recognition, and molecule reconstruction plus validation of the result. All steps of the process make use of chemical knowledge in order to detect and fix errors. The system can be trained and adapted to different sources of input images. The reconstructed connection table can be used by all chemical software. Figure 1: This figure shows the drug azithromycin drawn with two different structural editors. The drawn molecules are identical but the images are quite different.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Large scale chemical patent mining with UIMA and UNICORE

Finding information about annotated chemical reactions for drugs and small compounds is a crucial step for pharmaceutical industries. This data often is presented in form of unstructured documents (especially patents) and manual extraction of this information is a timeand cost inefficient effort. In our project UIMA-HPC [1], we describe the combined usage of Unstructured Information Managment A...

متن کامل

Study of Preservation Status and Dietary Reconstruction in Human Remains Recovered from Roopkund Lake through Chemical Analysis of Faunal Remains

The Present study carried out on the bone samples collected from Roopkund Lake in Chamoli Garhwal, Uttarakhand, which is located 5029 meters from main sea level in between Nanda Ghunghti and Trishuli peak. This historical site belongs to the 9th century A.D.  All the samples selected for the study were dried in room temperature, as well as in hot air ovens at 32 degree Celsius. The Cleaning, pr...

متن کامل

The Effect of Output Tasks on the Noticing and Learning English Passive Structure

This study was an attempt to investigate whether output tasks, i.e., reconstruction and picturecuedwriting tasks, promote learners’ noticing of English passive structure compared to nonoutputtasks, i.e., reading comprehension and if so, which output task is more effective inenhancing learners’ noticing. In addition, this study aimed to investigate whether output tasksfacilitate learning of Engl...

متن کامل

Nanotechnology for peripheral nerve regeneration

Peripheral nerve injuries (PNI) can lead to lifetime loss of function and disfigurement. Different methods such as conventional allograft procedures and using of biological tubes have problems for damaged peripheral nerves reconstruction. Designed scaffolds with natural and synthetic materials are now widely used in the reconstruction of damaged tissues. Utilization of absorbable and non-absorb...

متن کامل

Nanotechnology for peripheral nerve regeneration

Peripheral nerve injuries (PNI) can lead to lifetime loss of function and disfigurement. Different methods such as conventional allograft procedures and using of biological tubes have problems for damaged peripheral nerves reconstruction. Designed scaffolds with natural and synthetic materials are now widely used in the reconstruction of damaged tissues. Utilization of absorbable and non-absorb...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011